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Abstract 

Glassy systems are characterized by an extremely sluggish dynamics without any simple sign 
of long range order. It is a debated question whether a correct description of such phenomenon 
requires the emergence of a large correlation length. We prove rigorous bounds between length and 
time scales implying the growth of a properly defined length when the relaxation time increases. 
Our results are valid in a rather general setting, which covers finite-dimensional and mean field 
systems. 

As an illustration, we discuss the Glauber (heat bath) dynamics of p-spin glass models on 
random regular graphs. We present the first proof that a model of this type undergoes a purely 
dynamical phase transition not accompanied by any thermodynamic singularity. 

1 Introduction 

A broad class of liquids show a dramatic slowing down of their dynamics as the temperature is lowered 
(or the density increased) without any simple sign of long range order setting in [1]. In particular, 
the static structure factor S{k) = {Pf:p_j^ is hardly distinguishable from the one of a liquid, even if 
the system has become a solid from a dynamical point of view. We shall generically refer to systems 
displaying analogous phenomena as 'glassy'. 

The above features are reproduced in mode-coupling theory, as well as in some mean-field models, 
where the slowing down is promoted to a real dynamical phase transition without any simple static 
signature [2]. In both these theories, only short range correlations are taken into account, thus leading 
to the idea that they are the only responsible for the slow dynamics. 

Physical commonsense suggests however that, in systems with finite range interactions, only co- 
operative effects on large length scales may lead to a large relaxation time. This expectation has 
been recently substantiated by introducing a 'dynamical' correlation length in terms of a (dynamical) 
four points correlation function O HI [3 O [7j (see [51 [9] for recent developments). In this paper we 
confirm this line of thought through a rigorous argument. We define a time scale r, and length scale 
i appropriate for glassy systems and prove that they satisfy the inequalities 

Ci^ < r < expjCa^'^} , (1) 
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where Ci^2 are two constants to be specified below, and d is the system spatial dimension. This 
implies, in particular, that glassy systems are characterized by a large length scale, increasing as the 
temperature is lowered. The solution to the conundrum is that ^ is defined in terms of point to set 
instead of point to point correlations (which are probed, for instance, in scattering experiments). It 
is worth mentioning that the definition of i is purely statical (it does not depend on the dynamics, 
as long as this satisfies a few conditions), and that it is closely related to the Gedanken experiment 
discussed in ilOll . 



The physical intuition behind ([T]) is elementary. The lower bound follows from the observation 
that in order for the system to relax, information must be spread through at least one correlation 
length, and this cannot happen quicker than ballistically. For the upper bound, one argues that, 
without much harm, the system can be cut into boxes of size £, and that, within each box, the 
relaxation time cannot be larger than exponential in the volume. 

Both these arguments apply to a much larger family of models than the ones defined on d- 
dimensional lattices. Here we consider systems on general factor graphs with bounded degree (see 
Section [2T] for a definition). In this case, the factor in the upper bound of Eq. ([T]) must be replaced 
by the volume of a ball of radius i in the graph. There are two motivations for considering such 
generalizations. On one hand, sparse random graphs allow to define mean field theory, while retaining 
some features (a bounded number of neighbors and a locality structure) of finite dimensional models. 
On the other hand, they appear in a variety of random combinatorial and constraint satisfaction 
problems, ranging from coloring [H] to i^-satisfiability [12]. In this context, one recurrent question is 
whether configurations (or solutions) can be sampled efficiently using the Markov Chain Monte Carlo 
method. Understanding the relaxation time scale for Glauber dynamics is relevant for this question. 
In this context, point to set correlations and their relation to relaxation times were first considered 
in [13] and subsequently studied in [14]. In the following pages, we shall analyze one prototypical 
example of such system: the p-spin model (also known as XOR-SAT) on random regular graphs. 

The relation between correlations in space and time (or spatial and temporal mixing) is indeed a 
well established subject in probability and mathematical physics, with many beautiful results (see for 
instance [151 ITB] and references therein) . There are several reasons for presenting a new variation on 
this classical theme: (i) Most of the mathematical literature deals with translation invariant systems 
on finite dimensional lattices, {ii) The focus there is on 'global' characterizations of the space and 
time correlations, such as the spectral gap of the Markov generator, or the log-Sobolev constant. 
These are hardly accessible to experiments or numerical simulations, and in fact are not considered in 
the physics literature. {Hi) The dichotomy between vanishing and not-vanishing gap (or log-Sobolev 
constant) which is crucial there, is not that important for glasses. In many such systems, the gap 
vanishes well above the glass transition due to Griffiths singularities, or to the existence of metastable 
states that are not relevant when the dynamics is initiated with a random initial condition, (iv) 
The spatial mixing hypothesis used in mathematics requires decay of correlations uniform over the 
boundary conditions. This is probably too restrictive, especially in models defined on sparse graphs, 
for which the boundary of a domain can scale like its volume. 

The treatment presented here is essentially self contained, and based on elementary combinatorial 
arguments (in part inspired by [I7j). In Section [2] we provide our definitions of length and time 
scales and state precisely our main results. In the same Section we discuss qualitatively possible 
generalizations. As an illustration of the main results we consider in Section H] the p-spin glass 
model on random graphs. In order to carry on our analysis, we prove the first rigorous bounds 
on the relaxation time of a model in this class. In particular, these imply the occurrence of a 
purely dynamical phase transition as the temperature is lowered. Section [3] contains a few alternative 
definitions of length scale and a discussion of their equivalence. The proofs of the main results are 
contained in Sections [5] and [U with the most technical parts relegated in a series of Appendices. 
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2 Definitions and main result 



2.1 General graphical models: Equilibrium distribution 

Factor graphs [18] are a convenient language to describe a large class of statistical mechanics models. 
A factor graph G = {V, F, E) is a bipartite graph with two types of vertices: variable nodes (also 
called sites in the following) V B i,j,k,... and function nodes F B a,b,c, . . . . Edges are ordered 
pairs (i,a), with i £ V and a £ F. The number of variable nodes in G will be denoted by = \V\, 
and we shall identify V = {1, . . . , N}. Given i £ V (respectively a £ F), its neighborhood di {da) is 
defined as the set of function nodes a (variable nodes i) such that {i,a) £ E. We assume that the 
graph has hounded degree, i.e. that \di\, \da\ < A for some A > 0. 

The distance dij between two nodes i, j £ V is the length (number of function nodes encountered 
along the path) of the shortest path joining i to j. Given a non-negative integer r and a node i £ V, 
the ball of radius r around i, Bj(r), is the subset of variable nodes j with dij < r. With a slight abuse 
of notation, Bj(r) will sometimes be the sub-graph induced by these nodes. Finally, given U C we 
let dU C V\U be the subset of variable nodes at unit distance from U. 

We deal with models with discrete variables, taking values in the finite set X. A configuration 
is a vector x = {xi, . . . ,xn), xi £ X. For A C y, we write xa = {xi : i £ A}. In order to lighten 
the notation we shall write Xr^i^r instead of xy\Bi(r) foi' the configuration outside the ball of radius r 
around i. 

A probability distribution over configurations is defined by introducing one compatibility function 
V'node for cach node in G. These are non-negative functions, with -0^ : X'^°' — > M at function nodes 
and tjji : X ^ a,t variable nodes. We define 

tJ'ix) = ■^'[\ll^a{xda)'[\A{Xi) ■ (2) 

Compatibility functions can also be used to define conditional probabilities. Let C/ C y and F{U) C F 
be the subset of function nodes having at least one neighbor inU. If y = (yi, . . . , yj\f) is a configuration, 
we define 

l^u(^) = -^ n ^a{xda)YlMxi) , (3) 

if X coincides with y on = V\U (i.e. xjjc = yijc) and fJ-fjix) = otherwise. The normalization 
constant Zfj ensures that Ylx l^ui^) ~ ^■ 

Since we allow for vanishing compatibility functions (hard core interactions), the above expressions 
could be a priori ill defined. In order to avoid this, and to simplify our treatment, we shall restrict 
ourselves to permissive interactions (the definition we give here is very close to the one of [IZ])- This 
means that for each site i there exists x* £ X such that, given any non-empty J7 C if Xj = x* for 
each i £ U, then the right hand side of Eq. ([3]) is strictly positive regardless of the assignment of x 
on dU. In particular, this must be the case when the subset is a single vertex U = {i}. We denote by 
/iQ > a lower bound on the conditional probability for the corresponding state to be x*. In other 
words, we ask that /U^(x*) > fiQ for all i and y. 

2.2 Glauber dynamics 

The dynamics is specified as a single spin flip, continuous-time Markov process, irreducible, aperiodic 
and satisfying detailed balance with respect to the distribution ([2]). More precisely, for each variable 
node i, a set of transition rates Kf{^) > 0, with ^ £ X and Kf{^) = 1 is specified. Each variable 
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node i € V is associated to a clock, whose ringing times are distributed according to independent 
rate-one Poisson processes. When the clock at site i rings, a new value ^ & X is drawn from the 
distribution Kf (^), x being the current configuration. The new configuration x' coincides everywhere 
with X but on i, where x[ = ^. In order to verify detailed balance, the following condition must be 
satisfied by the transition rates: 



for any two configurations x and x' that coincide everywhere but on i. 

In our treatment we shall make two assumptions on the transition rates. 

• Locality. The transition rates n^i-) depend on the current configuration x only through xj, 
with dij < 1. Although its precise form could be modified (and somewhat relaxed), this is a 
crucial physical requirement: the Markov dynamics must be local with respect to the underlying 
graph G. 

• Permissivity. Let x* the state permitted at node i regardless of the configuration at 
Then, there exists kq > such that K,f (.x*) > independently of i and x. This assumption 
can somewhat be relaxed at the expense of some technical difficulties. 

A well known example of transition rates satisfying these conditions is given by the so-called 
'heat-bath rule': 



It will be also useful to define the Markov dynamics on a subset of the vertices U Q V, with 
boundary condition y G . By this we mean that the initial configuration agrees with y on V\U, 
and that variables outside U are 'frozen': each time the clock rings at a site i € V\U, the configuration 
is left unchanged. 

We shall sometimes use the notation ( • ) for averages with respect to the equilibrium distribution 
fi or the Markov process defined above, with initial condition distributed according to We also 
denote by ( • averages with respect to the subset in U with boundary y (i.e. cither with respect 
to the distribution /x^ or with respect to the Markov process on U with initial condition distributed 
according to fi^). 

2.3 Length and time scales and their relation 

Both our definitions of length and time scales depend on a parameter e. This should be thought of as 
some fixed small number (let's say 0.01), that provides a cut-off for distinguishing 'highly correlated' 
from 'weakly correlated' degrees of freedom. The physical idea is that, near a glass transition, the 
order of magnitude of the resulting length and time scales should be roughly independent of e as 
long as this is smaller than a characteristic value (the 'Edwards-Anderson' or 'ergodicity breaking' 
parameter) . 

Consider a vertex i in G. Let f(xi) be a function of the variable at i, and F{xr^i^r) a function of the 
variables whose distance from i are larger than r. The covariance {f{xi)F{x^i^r)) — {f{xi)){F{Xr.^i^r)) 
measures the degree of correlation between the observables / and F. In order to quantify the degree 
of correlation of Xi and x^i^r, it makes sense to consider the 'most correlated' observables and define 



Kfix'^ = nix') nf (xi) . 



(4) 



(5) 



Gi{r) = sup {f{Xi)F{x^i^r)) - {f{x,i)){F{x^i^r)) 



(6) 



f,F 
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where the sup is taken over all functions such that \ F(xr^i^r)\ ^ 1 for any x. The correlation 

length £j(e) of vertex i is defined as the smallest integer i such that Gi{r) < e for all r > i. In 
formulae 

= min{^ > s.t. Gi{r) < s y r > £} . (7) 

If no such i exists, ^i(e) is set by convention to the maximum distance from i of a vertex in G. 
The time scale definition is completely analogous to the above one. We let 

Q(0 =sup|(/(x,(0))/(x,(0)) - (/(x,(0)))(/(x,(t)))| , (8) 
/ 

the sup being taken over functions of the variable at i such that < 1 for any Xi. Then we let 

Ti{e) = inf{T > s.t. Gi{t) < e V t > r} . (9) 

This expression is well defined, and the resulting Tj(e) is always finite. In fact, if we define C/ (t) = 
{f{xi{0))f{xi{t))) — {f{xi{0))){f{xi{t))), the spectral representation of the transition probabilities [19], 
implies C({t) = Er=2 /). Here < A2 < A3 < . . . are the eigenvalues of the Markov 

generator, the Bi are non-negative coefficients, and n is the number of configurations of the system. 
In consequence this correlation function is positive, decreasing and has a vanishing limit when t — > 00. 

Both definitions given here admit several essentially equivalent variants which can be helpful 
depending on the circumstances; we shall discuss some of them in Sec. [31 We can now state our main 
result, whose proof can be found in Sees. [5]and[6l 

Theorem 1. Under the hypothesis presented in this Section 

Ciii{\X\V2^) <n{e) <l + Aexp{G2\Bi{ii{e/2))\} , (10) 

where A = log(|), Ci = l/2eA^, C2 = — log(Ko(l — e""'^)), and the lower bound holds under the 
assumption ii{\X\V2e) > log2(2/e). 

Let us stress that Ref. [T3] (Theorem 1.5) proves a very general result that is closely related to the 
above lower bound. Despite the fact that both the statements and the proofs are quite similar, we 
think that our formulation can be better suited for physics applications. The authors of [I3j provide 
in fact a bound on the spectral gap, a quantity that is hardly accessible in experiments or simulations. 
Further it does not correspond to an interesting time scale when studying the glass transition. In 
fact, the relaxation time defined as the inverse spectral gap becomes exponential in the system size 
well above the dynamical glass transition (roughly speaking, at the appearence of the first metastable 
states). This can be shown, for instance, in the example of Section [H 

2.4 Problems and generalizations 

Let us briefly discuss Theorem [1] and a few related research directions. As shown in Section HI this 
theorem can be only marginally improved in the general setting described above. It would therefore 
be interesting to determine additional conditions under which the upper/lower bounds are in fact 
closer to the actual correlation time. 

Recently there has been a considerable interest in a 'dynamical' length scale ^4 defined through 
four point correlation functions [9j. It would be interesting to understand whether a general relation 
holds between ^4, and the length £ defined in Eq. ([7]). Within mean field models undergoing a 
discontinuous glass transition, it has been argued that they are in fact closely related [201 121j . 
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The notion of 'growing length scale' is useful also when the initial condition for the Markov 
dynamics is not drawn from the equilibrium measur^ Of particular interest is the case of a 

uniformly random initial condition x G (a 'quench from infinite temperature'). The definition 
([7]) can be generalized to this setting if the expectations in Eq. ([6]) are taken with respect to the 
measure at some time t > 0. The resulting length will depend on time: ii{£,t), and is expected to 
increase with t, starting from £j(e,0) = until it reaches the equilibrium value £i{e,oo) = ii^e). The 
disagreement percolation technique used to prove the lower bound in Theorem [1] can be generalized 
to this case, cf. Section [5l to show that ii{e,t) <Ct. 

Finally, it would be interesting to consider more realistic models for glasses, e.g. off-lattice particle 
systems with Langevin dynamics. We guess that similar ideas to the ones proposed here can be useful 
in that case, although at the expenses of several technical difficulties. 



3 Alternative definitions of length scale 

The basic physical idea in the definition of length scale is to look at correlations between a point (a 
vertex) and the whole set of variables at distance larger than r from it. Equation Q provides one 
measure of these correlations. Here we shall define four alternative measures Gf\r), n = 1,...,4. 
For any such measure, we can define a length scale exactly as in Section 12. 3^ 

4"^(e) = min{£ > s.t. Gj"^(r) < e V r > £} . (11) 

The question which we shall address shortly is to what extent these definitions are equivalent. 

The first two definitions express the idea that two random variables are weakly correlated if their 
joint distribution is (approximately) factorized. With an abuse of notation we denote by fi{xi,x^i^r) 
the joint distribution of Xi and x^i^r, and by fJ-{xi), nix^^i^r) their marginal distributions. Then we 
define 

Gf^(r) = sup ^ |^(x^i,r.|Xi) - /i(x^j,rk-) , (12) 

G'f^(^) = X] ^fJ,{Xi,X^i^r) - Kxi)fJ-ix^i^r) ■ (13) 

Xi fX r^i_^f 

One inconvenient of the definition of Gi{r), cf . Eq. ([6]) , as well as of G ■ (r ) and G (r ) is that they 
are difficult to evaluate. Equation ([6]), for instance, requires the optimization with respect to F which 
is, in general, a function of Q{N) variables. Given a function / of the variable at i, we let / be the 
function of Xr^i^r obtained by taking the expectation of / with respect to the conditional distribution 
corresponding to the boundary condition x^^i^r outside Bj(r). In formulae: f{x^i^r) = {f{xi))B {r)- 
We then define 

(r) = sup {f{x,mx^i,r)) - {f{xi)){Kx^^,r)) , (M) 

/ 

where, again, the sup is taken over the functions / such that |/(xj)| < 1 for any x. Note that one can 
always decompose an arbitrary function /(xj) in terms of the \X\ indicator functions f^{xi) = I(xj = 

^), hence (r) can be computed with a finite number of covariance estimations. The correlation 

function G ■ (r) was already used in [14] to bound the spectral gap of Ising and hard core models on 
trees. 



^The question was posed to us by Giulio Biroli. 
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A suggestive interpretation of the last definition is provided by the following procedure. Generate 
a reference configuration x according to the distribution fi, and evaluate f{xi) on it. Then 'freeze' 
everything is outside Bj(r) and generate a new configuration x'^ inside, according to the conditional 
distribution /ig ^^,) (i.e. with the boundary condition given by the frozen variables). Evaluate f{x'^ on 

(3) 

the new configuration. The correlation function G- (r) is (the sup over / of) the covariance between 
f{xi) and f{x'j). This procedure was indeed discussed in [TO]. 

A last definition consists in considering the mutual information [22] between Xi and Xr^iy. 

Gf\r) = I{Xi-X^,^r). (15) 

Recall that, given two discrete random variables X and Y with distribution p{x,y), their mutual 
information is defined as I{X;Y) = y P(-^^ y) p{x)p{y) • ^ pleasing interpretation follows from 
the general principles of information theory. Suppose that a configuration x is generated according 
to the distribution jj, but only Xr^i^r is revealed to you. In a very precise sense, G['^\r) measures how 
much information you would have about Xj. 

It turns out that the length scales extracted from G^^^(r), . . . , G\'^\r) convey essentially the same 
information as ii{£). More precisely, they differ only by a redefinition of the parameter e and a 
rescaling. 

Proposition 1. Let ^•"^(e), n = 1, ... ,4 be defined as in Eq. ill]), and /U* = min^^.g^ fJ-^Xi). Then 



if\\X\e)<Ue) <£f\e), (16) 

i?\e)<e^\e)<if\f,.e/2), (17) 

£?\\X\V~e) < if {e) < Ue) , (18) 

4^)(^/2i) <#)(.)< 4^) (^). (19) 



Under the permissivity assumptions, we have ^* > > and the changes of argument in the 
above expressions amount therefore to a finite rescaling in e. The proof of this proposition can be 
found in App. |Al 

4 A mean field example: the j9-spin model on random (hyper)graphs 

In this Section we discuss a particular example exhibiting glassy behavior. Our objective is twofold. 
First of all, we want to check to what extent Theorem [1] can be improved over. Second, we want to 
show that the idea of diverging correlation can be of relevance even for mean field systems, as soon 
as the underlying factor graph is sparse. 

We consider a model of Ising spins a = (ui, . . . ,a]\f), G {+1,-1} (for historical reasons, we 
use here Uj instead of Xj to denote the i-th variable), defined through the Boltzmann distribution 

(20) 

M 

X]<^aO-n(a) •••%(a) • (21) 
a=l 

Here we think of i G {1, . . . , N} as the variable nodes of a factor graph G, and of a G {1, . . . , M} 
as its function nodes. Further, we assume G to be a random factor graph with degree / at variable 



1 



-f3E{a) 



m 

M 

E{a) = -J^JaH^^ 

a=l iPda 
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nodes, and p at function nodeqj. A random factor graph from this ensemble wih also be referred 
to as a 'random /-regular hypergraph' (function nodes being identified with hyperedges joining the 
neighboring variable nodes). Finally, /? = 1/T is the inverse temperature and the Jq's are i.i.d. 
random variables, uniform in {+1, — !}• 

In the following it will be always understood that p,l > 3. It is in fact expected that for p = 2 
the model undergoes a spin glass transition without a dynamical phase transition, and for / = 2, no 
phase transition at all occurs at finite temperature. 

As the variables crj can only take two opposite values, the general definitions given in Sec. 12.31 
simplify somewhat. For instance 



The phase diagram of this model is expected to be characterized by two phase transitions [23]: 
a dynamical phase transition at temperature T^, and a statical one at Tc < (see also ^201 [21] 
for a detailed study of the dynamics in the case of uniformly drawn random hypergraphs) . At high 
temperature T > T^, Glauber dynamics is fast and the relaxation time r does not depend on the 
system size A^. Note that, in the thermodynamic limit, any finite neighborhood of any spin is a regular 
hypertree with high probability. Therefore as far as the r^'s are finite, they converge in probability 
to a deterministic value r. 

Below Td, the Tj's become instead exponentially large in N. Heuristic estimates on the exponen- 
tial rate can be expressed in terms of free-energy barriers, computed from the so-called quenched 
potential [Ml |25j . In [26j , it was shown that the spectral gap is exponentially smalU at low enough 
temperature for some values of p and I. 

Remarkably, the free energy remains analytic in the interval (Tc,oo), and a true thermodynamic 
phase transition takes place only at Tc. If p > I, Tc = 0, and there is no thermodynamic phase 
transition. 

Assuming that this picture is correct (below we shall partially confirm it). Theorem [1] implies that 
the correlation lengths ii are finite for T > T^. On the other hand, for T < T^, the £j's are necessarily 
divergent in the system size. Since the size of a ball of radius r is bounded as |Bj(r)| < l{p — \Y{l — lY, 
a moment of thought shows that I > C log N. On the other hand, £ cannot be larger than the graph 
diameter, whence i = O(logA^). 

In the high temperature phase T > Tj, the correlation function G^^\r) can be computed recur- 
sively, using a non-rigorous approach that exploits the locally tree-like structure of the factor graph. 
Let us sketch the procedure here and refer to |2H [27] for a detailed description. We consider a rooted 
tree factor graph with R generations T^,{R) defined as follows. For R = 0, T*(i?) is a single node 
(the root). For any R>0, one first defines J^:{Ry by joining (/ — 1) copies of J^{R) at the root, and 
then joins {p — 1) copies of T^<(i?)' to a common function node a of degree p to obtain T^{R + 1). 
The remaining variable node adjacent to a is the new root. One then generate a configuration a on 
this graph according to the Boltzmann weight ()20p . 'freezes' it from generation r on, and consider 



^More precisely G is distributed according to the corresponding configuration model. To sample a factor graph from 
this ensemble, each of the A*' variable nodes is attributed I sockets, and each of the M function nodes, p sockets. The 
Nl = Mp sockets on the two sides are then matched according to a uniformly random permutation over Nl elements. 
Multiple edges are removed, if they occur an even number of times, and replaced by a single edge, in the opposite case. 

^One interesting feature is that the temperature Tgap below which the gap becomes exponentially small, is higher 
than Td. In the temperature range < T < Tgap, the slow modes correspond to metastable states that are 'not seen' 
if the dynamics starts from a random or equilibrated initial condition. 




(22) 
(23) 
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the conditional fii^ri ■ ) = /^(<7j = • Ic^i^r), i being the root node. When Ur^i^r is generated randomly 
according to the above procedure, • ) can be considered as a random variable. Knowing its dis- 
tribution allows to compute G^^\r) (it turns out that this distribution does not depend upon R). 
Thanks to the tree structure, a recursive distributional equation can be written for • ), and solved 
numerically using a sampling ('population dynamics') technique. 

In Fig. [H left frame, we plot the results of such a computation for p = I = 3 and a few values 
of T. As the temperature is lowered towards Td, G^^\r) develops a plateau of length diverging 
as (T — T(j)~^/^. As a consequence, for any e smaller than the value of G^^^{r) on the plateau, 
(T — Td) In order to check whether the lower bound of Theorem [1] is optimal, we 

estimated the time r from Monte Carlo simulations of the heat bath dynamics on large {N = 10^) 
samples. We averaged over several samples and checked that Tj is approximately independent of i. 
Our data are presented in the right frame of Fig. [1] and compared with the lower bound of Theorem [1] 
with i^^^ evaluated via the recursive method. The correlation time turns out to diverge algebraically 
at Td: r ~ (T - Td)"^. Fitting the data, we get 7 « 3.2 > 1/2. 

In Fig. [2] we plot the exponential growth rate of the correlation time obtained by extensive nu- 
merical simulations on small {N = 100) systems (see also [28] for an analogous study in the fully 
connected limit). As above, we consider here the case I = 3, p = 3. This is compared with a rigorous 
lower bound stated in Proposition [2] below, and with the upper bound obtained from Theorem (H 
using |Bi(r)| < A^. 

The above discussion of the behavior of correlation times was largely based on the analogy with 
the fully connected spherical model [2], and an on heuristic arguments. Here we confirm rigorously 
several elements of this picture. For the sake of definiteness, we shall consider Glauber dynamics with 
the heat bath rule 

<(ai) = ^ (l + aitanhphiia)) , (24) 

where hi{a) = Ylaedi '^aY\j^Qa\i'^j 'local field' acting on the i-th. spin. Then we have the 

following result, whose proof is deferred to Appendices ICl and iDl 

Proposition 2. Let Tj(e) be the correlation time for spin ai in a p-spin model on a random l-regular 
hypergraph with p, I > 3. Let T^f^ = (arctanh(l//(p — 1)))"-*^, and T^Y^ ' '^pT^ temperatures 
defined in Appendix [Dl 

IfT > T^J^, then Ti{e) < (l//t) log(l/(Ke)), where k = (1 - l{p - l)tanh^). 

If < T < Tp'l" , then there exists constants and T > such that, for any 1/4 > (5 > 0, 
^ e^t^^'^l for at least N{q^ — 6 — e) spins with high probability. 

The analysis in Appendix [D] also provides rather explicit expressions for the temperatures T^^f" , 
^ well as for g^, and T. The numerical values of some of these constants are reported in Tabled] 
for a few values of p and I. 

We are now in position to discuss to what extent Theorem[T]can be improved (here we focus on the 
large i, large r behavior) without loosing in the generality of its hypotheses. In the high temperature 
phase r ~ {T — T^)~'^ , while £ ~ (T — Td)"^/^, and the analogy with fully connected models suggests 
7 > 1 quite generically [2]. Therefore we expect that the lower bound can be improved at most to 
r > £^ with, probably, C = 2- 

At low temperature r = exp{G(A^)} while Theorem [T] implies r < expjCA^}. This is optimal 
apart from a possible improvement in the exponential rate. Our conclusion is that, without further 
assumptions on the system. Theorem [T] can be improved at most to CiCj < ti < exp{C2|Bj(£)|}. 

Let us finally observe that Proposition [2l together with Lemma H] imply that the model (|2ip 
undergoes a purely dynamical phase transition. This is the first time such a behavior is proved for a 
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p 


/ 


^ ann 


^barr 


Td 


Tfast 


3 


3 





0.470124 


0.510 


5.944027 


3 


4 


0.854138 


0.687684 


0.753 


7.958158 


3 


5 


1.113214 


0.849507 


0.935 


9.966577 


4 


3 





0.376808 


0.410 


8.962840 


4 


4 





0.575513 


0.625 


11.972171 


4 


5 


0.771325 


0.724693 


0.785 


14.977751 



Table 1: Various characteristic temperatures for the p-spin model on random /-regular hypergraphs: 
Tann is an upper bound on the static transition temperature T^, is the dynamical transition obtained 
by a cavity calculation; Tbarr and Tfast are, respectively, lower and upper bounds on T^. 




Figure 1: Numerical results for the 3-spin model on random 3-regular hypergraphs at high tempera- 
ture. Left: the spatial correlation function G\ (r), from left to right T = 0.6, 0.53, 0.52, 0.515. Right: 
correlation times. Symbols are obtained by Monte Carlo simulations of the heat bath dynamics, the 
solid line is the rigorous upper bound of Proposition[2j the dashed line corresponds to the lower bound 
from Theorem [TJ 

100 I 1 1 1 1 1-1 



0.01 



+ . 



0.0001 I 1 1 1 >— ' 

0.25 0.3 0.35 0.4 0.45 0.5 

T 

Figure 2: Correlation times for the 3-spin model on random 3-regular graphs at low temperature. 
Here we plot T = (logr)/A^ versus temperature (r being uniform average of Tj over the site i). The 
continuous line is the rigorous lower bound from Proposition [2l The dashed line is the upper bound 
from Theorem [TJ Symbols correspond to numerical results from Monte Carlo simulations. 
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model of this family (Glauber dynamics on a sparse graph spin model). In finite dimensional models, 
we expect that such a sharp dynamical transition cannot occur. Nevertheless, it would be interesting 
to understand whether the correlation time of glassy systems generically undergoes a crossover from 
a polynomial (r ~ to an exponential (r ~ exp{0(^^)}) relation with the correlation length. 

5 Proof of the lower bound in Theorem [1] 

For pedagogical reasons we first recall some definitions and well-known facts from probability theory, 
that will be repeatedly used in this and the following proofs. Consider a finite set S and two probability 
measures p^^\ p^'^^ defined on it. The total variation distance between these two measures is defined 
as 

l^|p(l)(^)_p(2)(^)| . (25) 

X 

One easily checks that this defines indeed a distance, in particular it vanishes if and only if p^^-* = p^'^\ 
and that the two following characterizations are equivalent: 

IIp^^^ -P^^^IItv = max|pW(r) -p^^\T)\ = 1 - Y,P(^) ' (26) 

X 

where we defined p{x) = mm[p^^\x),p^'^\x)]. 

Another very useful form of the total variation distance is defined in terms of couplings of the two 
measures, that is to say joint distributions X2) whose marginals are p*-"^^ andp^^^: X]x2 ^(■^i' ■^2) = 
p^^\xi) and 9(2:1, X2) =p^'^\x2). 

Considering the random variable {Xi,X2) £ S'^ drawn from such a coupling, one can easily prove 
that 

||p('^-p(')||Tv<P[^i/^2] . (27) 
Moreover one can construct an optimal (or greedy) coupling that achieves the bound. We thus have 

I \P^^^ - P^^^ I Itv = min F[Xi / X2] . (28) 

We can now start the proof of the lower bound in Theorem [T] (a presentation in a restricted setting 
is provided in [21]). This is based on a 'disagreement percolation' argument first used in [29] (for 
recent applications, see [HI [30]). More precisely, for a given variable node i and a positive integer r, 
we construct a Markov process {x^-^\t) , x^'^^t)) on x in the following way: 

• at the initial time t = 0, a configuration x{0) is drawn from the law fi, and imposed to the two 
trajectories, x^^\0) , x^'^\0) = x{0). 

• Each variable node owns an independent rate-one Poisson clock. When the clock at j rings, say 
at time t: 

— if J ^ Bj(r), x^^\t) is replaced by ^, drawn from kJ* x^^^ remains unchanged. 

— if j G Bj(r), we draw (^^1,^2) from the greedy coupling between and and 
replace (xf (t), xf (t)) by (d, 6)- 
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{x^^\t) , x^'^\t)) is a coupling of two Markov processes: viewing them separately, x^'^\t) is the original 
dynamics we are interested in, whereas x^^\t) is the dynamics on Bj(r) (cf. Sec. 12.21 for a definition) 
with boundary condition x^j^r(O). 

Consider now a function f{xi) with |/| < 1. Conditioning on x^j^r.(0), we can apply the observation 
stated in Sec. 12.31 on the dynamics of x^Q^^j.^{t) to obtain 

f{x^^AO)f < (/(4'^(0))/(4'^(i)))-f • (29) 
Averaging over x^i^ri^), this yields 

{f{xM)hx^^Am < {fi^^l\0))f{xl'\t))) . (30) 

We now introduce the indicator function 

!(*) = (' «-S"W-fW , (31) 
I otherwise 

in terms of which we can rewrite the above inequality as 

(/(x.)/(x.,.)) < (/(xf (0))/(xP(t))I(t)) + (/(xW(0))/(xP(t)) {1-im 

< (/(xP(0))/(xf)(t))) + 2(l-I(t)) . (32) 

In the last step we used the fact that / is bounded in absolute value by 1. 
A disagreement percolation argument implies the bound 



a-iW)<^;:^J • (33) 

Indeed, (1 — I(i)) is the probability that x^^^ and x*-^-* disagree on the variable node i at time t. 
The coupling between x^^^ (t) and x^'^^ (t) has been defined in such a way that at initial time the two 
configurations agree on all variable nodes, and that a disagreement on a node j S Bj(r) can appear 
when j is updated only if at least one of the neighbors of j already bears a disagreeing assignment. 
In other words, disagreement has to appear in F \ Bj(r) and propagates towards i. More formally, let 
us call a disagreement path a an ordered list of variable nodes a = {ir, ■ ■ ■ ,ii,io = i) such that two 
successive nodes are adjacent in G, and with duf, = k. A path a is said to percolate if there exists 
a sequence of times < ti < • • • < tr+i < t such that the clock of the node ik rings at time tr+i-k- 
The quantity (1 — I{t)) is thus bounded by the probability of the event "at least one disagreement 
path has percolated", which is itself smaller than the product of the number of such paths by the 
probability pr+i{t) for a given path to percolate. As a given clock rings in the infinitesimal time 
interval [tk,tk + dt] for the first time since tk-i with probability e~^*'=~*'=~"^^(it, 

ft ft ft °° —tj-S 

Pr+i{t) = / dtie-'^ / dt2e-(*^-*^)... / dt,+ie-(*'-+i-*^)= V ^ (34) 

JO Jtl Jtr g^^j^i ^■ 

< 7—-rTT< -—r] ■ (35) 



(r + 1)! ~ Vr + 1 

The number of disagreement paths can be bounded by A^C'^+i), hence Eq. §S\ 
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Subtracting (/(xj))^ from both sides of Eq. ([32]) . and taking the supremum over |/| < 1, we obtain 



Gf\r)<Cm + ^ 



A 2j. \ '■+1 



r + 1 



Setting t = Ti{e) and calhng = max(2eA^rj(e), log2(2/e)), we have 



Gf\r)<2e W > , 



(36) 



(37) 



hence an upperbound on £f\2e), which can be translated into the form stated in Theorem [T] using 
the inequahties of Proposition [TJ □ 



6 Proof of the upper bound in Theorem [T] 

The upper bound is proved by viewing the dynamics inside Bj(r) as the dynamics of a 'reduced' 
model whose degrees of freedom are only the ones in Bj(r), and on which the exterior acts as a time- 
dependent boundary condition. On one hand, time correlations inside the system decay in a finite 
time (since the system is finite). On the other, if r is large enough, the time-dependent boundary 
condition does not affect the behavior in the center of Bj(r). 

Let us begin by defining the Markov dynamics on a subset of the vertices U O V with time 
dependent boundary conditions {y{t)}t>o- This means that one is given a sequence of times to = < 
ti < ■ ■ ■ < tn < ■ ■ ■ with t„ — > oo as n — > oo, and of configurations y^, yi, . . . . The chain is initialized 
in a configuration x distributed according to . In each time interval [tn,tn+i)-, n = 0, 1, . . . , one 
runs the chain with boundary condition y^- Then, at time tn+i, the configuration outside U is changed 
from yn to Vn+i- Averages with respect to this process will be denoted by ( • )^^. 

It is convenient to state separately the following estimate on time decay of correlations for this 
dynamics (for the proof, see App. [B]) . 

Lemma 1. Let f and g he two functions of x ^ , such that |/(a;)|, \g{x)\ < 1 for any x. Then 

{f{xmg{x{t)))\f^ - (/(x(0)))g^>(<7(x(t)))Jf>| < 2e-(*-i)/-^ , (38) 

where = exp{A|C/|}, and A = — log(Ko(l — e~^)). 

Let us now turn to the actual proof. Fix a vertex i, a positive integer r, and consider a function 
f of Xi, with |/(xi)| < 1 for all Xj's. By conditioning, we can write 

'{f{xmf{x^m^f] , 



(/(x,(0))/(x,(t)))=E|,} 



(39) 



where ^{y} denotes expectation with respect to the process {y{t)}t>o distributed according to the 

for (•)b^(^)- Lemma [1] implies 



(stationary) Markov chain on G, and we used the shorthand ( 
(/(x,(0))/(x,(t))) < E|,| [(/(x,,(0)))g>(/(x,,(t)))g> 



+ 2e-(*-i)/^*(^''-) , 



(40) 



where T*{i,r) is a shorthand for ''"b (r)' '^^^ expectation on the right hand side can be simplified by 
conditioning on the initial condition y{0): 



E 



{y} 



{f{^^m)lf{f{xim^f 



■y(o) 1 ^{y{t>o)} 



%o)|(/(^^(0)))rf E 



(/(^.(0)))l?(/(x.(t)))g.^|y(0)]} 
{yit>o)} 



(41) 

(42) 
(43) 
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where we defined 

(44) 

Since / is uniformly bounded by 1, \F{xr^i^r)\ ^ 1 for all x as well. Moreover one easily shows 
that {F{x^i^r)) = ifixi)). Subtracting (/(xi))^ from Eq. (gO]) and using the definition of the spatial 
correlation function, cf. Eq. ([6]), we obtain 

C( (t) < Gi{r) + 2 e-(*-i)/-*(^''-) . (45) 

Taking the supremum over / and setting r = li(e/2) implies the upper bound of Theorem[TJ □ 



i^(y~i,r(o)) =E, 



yio) 



E 



{yit>o)} 



{fix^mW\ym] y~.r(o) 
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A Proof of Proposition [1] 

In this appendix we denote simply by x the variable Xi, by y the 'far apart' variables x^^i^r and by 
n{x,y) (respectively fi{x), ^{y)) their joint distribution (respectively, marginal distributions). Finally, 
we omit the arguments i and r from the correlation functions Gi{r), G^^\r), ...G^^\r). Proving 
Proposition [T] amounts to deriving the following inequalities between these functions: 

1 I 

G(2) < < — , 

1^(2)' <G(4) <(--l\ 

2 - ) 

Proofs of similar statements can be found repeatedly in the literature. We refer in particular to 
[31] for a general presentation, and to [32j that deals with the tree reconstruction problem which is 
closely related to the theme of this paper. We collect nevertheless the proofs here for the sake of 
self-containedness. Notice that the upper bounds in Eqs. ([^7|) and (|39|) become trivial if /x* = 0. We 
shall therefore assume, without loss of generalityj/i* > 0. 

(ji6]) . lower bound. In the definition of G, take|j/(x) = I(x = x=k), and F{y) = sign[/i(y|a;*) — ^(y)]. 
Then 

G>Y^ f{x)F{y)fi{xMy\x) - fi{y)] = fi{x,) \fi{y\x,) - fi{y)\ . (50) 
x,y y 

*We denote by t{A) the indicator function for the property A. 
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(46) 
(47) 

(48) 
(49) 



The thesis follows by choosing which maximizes the last expression. 
(j46]) . upper bound. We have 



G = sup 



f{x)F{y)[n{x, y) - /i(x)/i(y)] 



x,y 



<Y\Hx,y) -/i(x)/i(y)]| , 
x,y 



(51) 



that is what was claimed. 

(|^7|) . lower bound. We have 



Yti{x')[fi{y\x) - /i(2/|x')] 



<Y,Kx)Kx')Y.Hy\x) - KyW)\ , (52) 



And the last expression is upper bounded as sup^ Yly \fJ'{y\x) — fi{y\x')\ = G^^\ 
()T7|) . upper bound. We start by noticing that 

q(2) = ^pt(x)E|/i(y|x) -/x(y)| > sup | |//y|x( • k) - ^y( • )| |i , 



(53) 



where we introduced the standard notation for norm and used subscripts to precise which vari- 
able we are considering the distribution of. By triangular inequality ||^y|x(' l^^i) — ^y|js:(" 12^2)1 |i < 
2 sup^, ||/iy[x( ■ l^;) ~ t^vi • )||i for any xi, X2- The thesis follows by taking the sup over xi, X2- 
I8|). lower bound. Take f{x) = I{x = x^), and therefore f{y) = /i(x*|y). Then 



x,y y 
By maximizing the right hand side over x*, we obtain 



(54) 



(55) 



> 



1 



[li{x,y) - ^l{x)^^{y)f > 



Y\fi{x,y) - fi{x)fi{y)\ 



. x,y 



(56) 



9]), lower bound. We notice that I{X; Y) = D{p\\q) where p and q are distributions on the pair 



where the last step followed from Cauchy-Schwarz inequality. 
I8|). upper bound. Trivial: take F = f. 



z = {x,y) defined hy p{x,y) = n{x,y) and q{x,y) = fi{x)n{y), and D{p\\q) = Y^^p{z)log[p{z)/q{z)]. 
Defining px{z) = (1 — X)q{z) + Xp{z), by elementary calculus 



D{p\\q) 



(z)VdX. 



(57) 



By Cauchy-Schwarz (applied to the scalar product with weight l/px{z)) we have Z^z p^(z) ^P^' 

q{z)]'^ > {J2z \p(.^) ~ q{^)\)'^j ai^d the thesis follows. 

I9|). upper bound. We use p\{z) > (1 — A)g(z) in Eq. ([57|) . and get 

G^'^<Y.^^[p{z)-q{zf< sup P^'^ 



(58) 



The thesis follows by noticing that p{z)/q{z) = fi{x\y) / fi{x) < 1//U*. 
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B Proof of Lemma [T] 



Lemma [T] is a well known elementary result for Markov chains with time-independent boundary 
conditions (see for instance [15] for a functional-analytic argument). We present here an independent 
and self-contained proof for the general case. We start by restating it in a slightly stronger form. For 
this purpose, we need to define the dynamics on [/ C 1/, with time-dependent boundary condition 
t > 0} and generic initial distribution u. This is defined exactly as the process with time- 
dependent boundary condition introduced in Sec.[6l but for the fact that the initial state is distributed 
according to z/, instead of /x^^'^^ It is understood that v{x) = unless xy\jj = y{^)v\u- 

Lemma 2. Let and vf^ the distributions at time t for the dynamics on U ^V, with the same 
time- dependent boundary condition t > 0}, and initial distributions (respectively) , z/^^^ and u^'^\ 

Then 



,(1) „(2) 



where = exp{^|C/|}, and A 



,,Tv < e"^'""^'''^llz^^ 
log(Ko(l-e-i)). 



1/(2)11 

M TV ) 



(59) 



Proof: Denote by {x^°'\t)^t > 0}, a G {1,2} the two processes. We construct a coupling of 
these two processes, similar to the one of Sec. [5) 

• at the initial time t = 0, x("'^^(0) and ^(^-'(O) are drawn from the greedy coupling of v^'^'^ and i/^^-*, 
hence P [x^^\Q) + x(2)(0)] = - z/(2)||^^. 

• When the clock at j £ U rings, say at time t, we draw {^1,^,2) from the greedy coupling between 
K^- and and replace {x^j^\t) , x^^\t)) by (^1,^2)- 



Consider now two times t and t' = t + At > t. If the two processes coincide at a given time, the 
definition of the coupling implies that they will coincide at all subsequent times. Therefore 



(l)(t') / x(2)(t')] = {1 - P [x(l)(t') = x(2)(t')|x(l)(t) / x(2)(t)] } P [x(l)(t) / x(2)(t) 



(60) 



The conditional probability appearing in the last expression can be lower bounded by the probability 
of a particular event implying x^^\t') = x^'^\t') irrespective of x^^\t), x^'^\t). The event is defined 
as follows. Each variable in U tries at least one flip during the time interval [t, t') (this happens with 
probability (1 — e"'^*)'^'). Furthermore, the last time a flip is attempted on each of the spins, it 
brings the two coupled processes to coincide on it (this happens with probability at least kq for each 
spin). We have therefore 



x(i)(t') = x^^\t')\x^^\t) + x(2)(t)j > [ko(1 



AtMlt^l 



(61) 



Finally consider the time interval [0, t\ and split it into [t/AtJ sub-intervals of size At (plus, eventually 
a sub-interval of smaller size). By repeatedly applying Eq. ()60|) . recalling that (1 — x) < e~^ and 



V? - '^f ^IItv < P {x^^\t) + x(2)(t)], we get 



,(1) „(2)| 



<exp|-Ml-e-^*)]l^l [^J} 



.(1) 



V TV 



(62) 



The thesis is proved by taking At = 1, and noticing that \ t\ >t — \. 



□ 
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We next show that this result implies Lemma [TJ We have 

(/(x(0))5(x(0))Jf^ - (/(x(0)))ff>(5(x(t)))fr> 



^P{x(0) = x}f{x)g{x') [F{x{t) = x'\x{0) = x} - ¥{x{t) = x'}] 

< P{x(0) = x} ^ |P {x{t) = x'|x(0) = x} - P {x{t) = x'} I 



< 



(63) 
(64) 
(65) 



The sum over x' in the last expression is 2||f^^^'' — f^^^''||TVi for two processes of initial conditions 
i/W(^x') = I{x = x') and v^"^^ = fJ-^^^ ■ The proof is completed by applying Lemma [2l with — 



C High-temperature upper bound for the j9-spin model 

In this Appendix we prove the first part of Proposition [2 at high enough temperature, the correlation 
time rj(e) is, with high probability, finite. 

We begin by recalling that the temperature P^f^ = l/T^^^i^ appearing in the statement of Propo- 
sition [2] is defined as the largest value of /? such that 



{p-l)l tanh^ < 1, 



(66) 



We shall also use the notation p{a, r) to denote the Hamming distance (number of different spins) 
between two configurations a and r. The proof makes use of the following crucial result. 

Lemma 3. Consider the p-spin model, cf. Eqs. I120\) and I121\) . at inverse temperature [3 < P^f^, 

and let fi'^ he the Boltzmann measure, conditioned to ai = ±1. Then there exists a coupling of fi^^ 

and such that, if {a, r) is a pair of configurations distributed according to such a coupling, their 
expected Hamming distance is bounded as 



l-l{p- l)tanh/3 



(67) 



Proof: Without loss of generality, we set i = 0. 



Given a coupling v between /i^^ and fJ-^^\ and a pair of configurations a, r distributed according 
to ly, let Pj(z^) be the probability that aj ^ tj under v. By definition, poiu) = 1. We claim that, given 



j V, j 0, it is possible to construct another coupling u' between /i^"* and fi^^' in such a way that 



,(0) 



Pj {ly') <tanh f3 E Pfc('^), 

Pk{iy') = Pk{v) V/c / j 



(68) 



We shall denote in the following Tj the mapping v' = Tji^. The coupling ly' can be defined through 
the following sampling procedure. First sample a and r from z^. Then draw a'j, rj by coupling in a 
greedy fashion the conditional distributions jj,{aj\ar^j), and /^(^jk-^j) (to be explicit, we shall denote 
these conditional distributions as /ij(-|c7^j), and /^j(-|T^j) in the following). Finally set o"^ = 
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and r'f, = for all nodes k j, and define z/' to be the joint distribution of a' and r'. Obviously 
Pk{^') = Pk{i^) for all k ^ j. Moreover 

Pji"') =^'^i(^^T)\\Hi- k'-i) - Mj( • k~i)||Tv • (69) 

cr,r 

Notice that - depends on a^^j only through (Tq = {cr^ : G da\j}, with a € 9j. Denote by 

ai, . . . ai the indiceqfl of function nodes which are neighbors of j and define (T^*\ t = 0, . . . , / in such a 
way that aa] = eras for s < i and ai*'' = Ta^ for s > i (in particular a^^^ = r and cr^'^ = a). Then 

I 

Pjii^') < E^(^'^)Eii/^^(-i^S)-/^^(-i^~7'^)ii-< (70) 

cr,r t=l 

< ^z.(a,T) /T„Jtanh/3 = tanh/?^P^(a„, /t„J. (71) 

cr,r t=l t=l 

Here we used the fact that 



lMj( • kS) - /^i( • ki*.- ^^)IItv - ^ 



tanh I /? 5^ Ja n 4*M - tanh /? Ja J] ^i* 

agOj k£da\j j \ aGc'j k^da\J 



.(72) 



and I tanh a — tanh6| < 2tanh(|a — b\/2). Applying the union bound to Eq. ()71l) . we get Eq. ()68l) . 
We shall now construct another mapping v' = Tu by combining the elementary Tj. Defining 
and ordering arbitrarily the variable nodes 1,...,A^ — 1, we construct recursively the 
couplings v^^\ . . . , z^(^~^) with 

z.W= J"'^ '-^'"^ ^ , fori = l,...,iV-l . (73) 

I otherwise 

Finally we let = Tv. A moment of thought shows that 

Pj(Tz.) < tanh/? ^ Pk{u), (74) 

a&dj k£da\j 

for all j 7^ 0. 

To conclude the proof, denote by p(i^) the expectation of p{a, r) when u and r are distributed 
according to the coupling z^, which can also be rewritten as p(i^) = Pj{^)- By summing over j 7^ 
Eq. (j74p . recalling that po(^) = 1 and that each node k is the neighbor of at most l{p — 1) nodes j, 
we get 

p(Ti^) < 1 + /(p - 1) tanh/3p(z/) . (75) 

Since l{p — 1) tanh/? < 1, a coupling achieving (I67p can be constructed by iterating a sufficient 
number of times the transformation u ^ Tv from an arbitrary initial coupling. In fact the sequence 
of distributions T'^v admits a subsequential limit because the space of distributions over a finite set 
is compact. □ 



^Within the configuration model is possible (although with probability vanishing as A'^ ^ 00) that the variable node 
j has less than / neighboring function nodes. Although the proof remains valid in this case, we shall not consider it 
explicitly in order to lighten the notation. 
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Notice that the above proof is in fact closely related to the proof of Dobrushin uniqueness condi- 
tion [23j as described, for instance, in [34j. For arguments of this type we also refer to [35j. 

We are now in position of proving the upper bound in Proposition [2l Consider two processes 
{cr^~^\t)}t>o and {cr^~\t)}t>o evolving according to the Glauber dynamics with initial conditions 
distributed accordingly to (for a^~^\0)), and /U^'* (for (t^~\0)). In other words, {cr^~^\t)} (re- 
spectively {cr^~\t)}) is the stationary Glauber dynamics conditioned to a^^\o) = +1 (respectively, 
CTj- ^(0) = — 1). Then it is easy to show that 

a{t) = i(l - mf) [{al-^\t)) - {at\t))] , (76) 

where rui = {cTi) is the expectation of ai with respect to the (unconditional) Boltzmann measure. 

Given an arbitrary coupling of the two processes {a'^^\t)^, and {a^^\t)^, the correlation function 
is obviously bounded as 

Q(t)< (p(aW(t),cT(-)(t))) . (77) 

We construct such a coupling as follows. The initial conditions (t(+)(0), and a^^\Q) are chosen 
according to a coupling of the conditional distributions that achieves the bound (j67p . The joint 
dynamics is defined using 'path coupling' [361 I19j . This construction only requires to define the 
evolution of a^~^\t) and a^~\t) when they differ in a single position. If this is the case, we use 
the greedy coupling of the update probabilities, as in the proof of Lemma [H cf. App. [Bl Prom a 
pair of configurations (cr^+^cr^^)) with p[a^~^\a^~^) = 1, the expected (with respect to the greedy 
coupling) Hamming distance after one spin update can be upper bounded by 1 — (k/N), with n = 
(1 — l{p — 1) tanh /?). Standard path coupling arguments allow to extend this bound for an arbitrary 
number of spin updates. Let us denote by Ut the number of spin flips between times and t (i.e. a 
Poisson variable of mean Nt), and by E^/ the corresponding expectation: 

a{t) < (p (a(+)(0),a(-)(0)))Et; (l - < exp{-Kt} , (78) 

which clearly implies the thesis. □ 



D Low-temperature lower bound for the p-spin model 

In this Appendix we prove the second part of Proposition [2] on the correlation time of the p-spin model 
on random regular hypergraphs. Throughout the Appendix we denote by (5o-,t = Yld=i ^i'^i the 
normalized overlap of configurations a and r. We further let Z{P) be the partition function, i.e. the 
normalization constant in Eq. ()20p . and Z{q;P) the constrained partition function 

-/3£;(cr(i))-/3£;((T(2)) . 

7(1) .,7(2) 



where q G {-1, -1 + 2/N, . . . , 1 - 2/N, 1}. 

It is convenient to state a few preliminary results. We start by computing the expectation of Z{P) 
and Z{f3,q). It is straightforward to get 

E Z{p) = 2^(cosh/3)^ = e^<^(^) (80) 
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where we defined (/>(/5) = log 2 + - log cosh (3. The expected constrained partition function is only 
slightly more involved 



EZ(g;/3) = 2^ (^^^ coeff[((cosh2/3)p+(a;) +p_(2;))^^x(^-'^)'] = e^^^'^''^'^ (81) 

where we denoted by = identity to the leading exponential order, m = N{1 + q)/2, and p±{x) = 
(i+x)p±{i x)p ^ Using Haymann (or saddle-point) method, one obtains the exponential growth rate 

</>(g;/3) = log2-(/-l)/i(^i^) +^log((cosh2/3)p+(z,)+p_(z,))-/iy^logz, , (82) 

where h{x) = — xlogx — (1 — x) log(l — x) is the entropy function and Zq is the unique non-negative 
solution of the equation 

{cosh2(3)p'+{zg)+p'_{zq) _ 1-q 
^'{cosh2P)p+{zq)+p.{zq) ^ 2 • ^ ' 

The function (l){q; (3) is straightforwardly evaluated numerically. 

Let us underline some of its properties. One easily shows that for all temperatures, it has a 
local maximum at = 0, with 0(0;/?) = 2(j){[5). Moreover, for /3 = 0, the function reduces to 
'i^i.l] 0) = log2 -I- /i((l — (7)/2), for which q = Q is the global maximum. We define f3^™ as the largest 
value of (3 such that for any (3' < (3, (j){q; (3') achieves its global maximum at (7 = 0. Notice that, if 
p>l, then [3'^'^{^ = 00. 

Let us now turn to the behaviour around g = 1. A simple calculation shows that (j)[q = 1;/?) = 
4>{2I3). Let us define U{q;f3) = (j){q = 1; (3) — 4){q;j3), and U{q;oo) its limit as /? — > 00. Taking 
afterwards the limit g — > 1, one shows that U{1 — 6; 00) = — (/ — 2)(51og(5)/4 -|- 0{6). 

We define the annealed free energy barrier 

T(/3) ^ sup [(l-a)To(/?)-^(/?,a)], (84) 

aG{l/2,l) 

To(/3) = sup U{q;P), (85) 
((2 - a-i)/?) + (1 - a)^{2p) - 0(/3) . (86) 



a) = ac 



Prom the behaviour of U{q; 00) in the limit q ^ 1 one can deduce that, if / > 3, To(/3) is strictly 
positive for /3 large enough. On the other hand, a) is non-negative because </>(/?) is convex. From 
the low temperature expansion (/){f3) = (l — l/p) log2 -\- {I / p) (3 + 0{e~'^^) it follows that (j){P,a) — > as 
P 00. We proved therefore that, if / > 3, there exists /? < 00 such that T(/3') > for > /?. We call 
abarr ^^iq smallest (3 with such a property. The numerical estimation of the associated temperatures 
(inverse of /?) for a few values of {p, I) are given in Table [H 

These computations have several immediate consequences. The first is that they yield the free 
energy in the thermodynamic limit. 

Lemma 4. Consider the model i20\) . \21\) with p > 2> and I > 2, and let be given as above. 

Define the expected free energy density as (pNiP) = N~^'ElogZ{(3), and assume (3 < f3pY^- Then 
(pNiP) as N ^ 00. Furthermore, for any 6 > there exists C{6) > such that: 

¥{\logZ{P) - N(f>N{P)\ > N5} < 2e-^^(^) . (87) 



20 



Proof: The statement (Pn{(3) — > follows from the second moment method applied to the 
random variable Z{P) (notice that E Z{j3)'^ = exp{A^ sup^ (t){q; /?)})• Equation ([87l) can then be proved 
through standard concentration inequalities [37]. □ 



Lemma 5. Consider the model i20\). \21\) with p > 3 and I > 2, and let (j){q; (3) he defined as in 
Eqs. i82\). i83\) . Then, for any /3 > and any 6 > 0, and any N large enough: 

Proof: This is just Markov inequality applied to the random variable noting that 

lim{l/N)logEZ{q;P) = (^{q;P). □ 



Lemma 6. Consider the model \21\) with p > 3 and I > 2, and (3 < Let a^'^'> and cr*^^^ he two 

i.i.d. configurations drawn from the Boltzmann distribution /i20\) . and denote by Q12 be their overlap. 
Then, for any 6 > 0, there exist constants Ci and C2 > such that, for all N large enough 

P{KQi2)| > N5} < Ci(5)e-^^2(^) . (89) 

Proof: First notice that, by the two previous Lemmas, there exist constants Ci{6) and 02(6) 
such that, with probability at least 1-Ci (6) e'^'^'^^^^ , the following happens: (i) Z{q; 13) < e^[<^(9'/3)+'5'] 
for any q G {-1, -1 + 2/N, . . . , 1 - 2/N, 1}, and (ii) Z{j3) > e^t'^C'^)-'?']. Under these conditions, for 
any ^ > we have 

{Q12) < e + ip{iQi2i>e} = e + ^7^E^(«;/5) (9°) 

< ^ + e^N5'^ Yl exp{iV[0(g;/3) -2<^(/3)]} , (91) 
kl>e 

with the sum being restricted to q £ {— 1,— H-2/A^, ...,1 — 2/N, 1}. Recall that g = is a stationary 
point of (p{q', P) with 0(0, /3) = 2(/>(/?). A little calculus shows that (j)"{0;P) < for any p > 3. As a 
consequence for any T > T^y^ there exists a > such that (j){q; (3) < 2</)(/3) — aq^ . Therefore 

KQi2)| <e + 2A^e3^^'-^"«'. (92) 

By taking ^ = 25/y/a, we have |(Qi2)| < for any N large enough. The thesis follows by 

rescaling 6. □ 



The crucial step is made in the next Lemma, which estimates the global correlation function 

N 

N 



1 ^ 

C{t) = -Y.^am^^{t)). (93) 



i=l 



Lemma 7. Consider the model (2^), (MD with p>3, I >2 and P^f' < (3 < (3^^ . Let T = T{(3) be 
the associated annealed free energy harrier, andq^ be the largest value of q at which the sup in Eq. [8^) 
is achieved. Then, for any 5 G (0, 1/4], and t > {4/N) log{2/5), one has C{t) > q^ - 6 -te-^^^-^^ 
with high probability. 
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Proof: Consider two equilibrium trajectories {a^'^\t) : t > 0}, {a^^^t) : t > 0} evolving 
independently according to the stationary Glauber dynamics for the model (1201) . (I2ip . In particular, 
at any time a^^^ (t) and a^"^^ (t) are distributed independently according to the equilibrium distribution 
fi. We further let Q{t) = QaW{t),ai'^){t)- Throughout the proof, we denote by E and P, respectively, 
expectation and probability with respect to this process (not to be confused with expectation and 
probability with respect to the graph and energy function). 
It is elementary that 

Ci2t) = E [Q(t)k(i)(0) = a(2)(0) = a] . (94) 

where, for future convenience, we specified the temperature /? at which the Boltzmann measure 
must be considered. Let us denote by the event {a^^\0) = a^'^\0) = a}. Moreover, for any 
q £ {—1, —1 + 2/A^, . . . , 1 — 2/N, 1}, we denote by Aq^t the event that there exists a time s £ [0, t] 
such that Q(s) = q. Clearly 

E[Q{t)m = P{^g,t|IJE[Q(t)|/<„^,,j] + (l-P{^,,t|/,})E[g(t)|/^,:4,,i] (95) 

> -P{Ag,t\Ia}+q{l-P{Aq,t\Ia}) (96) 

= q - {I + q)P{Ag,t\Ia} . (97) 

Denote by Ut the total number of spin flips in the two configurations up to time t (this is a Poisson 
random variable of mean 2Nt). Then 

P{Ag,t\Ia} < P{Ag,t\I.,Ut<mt} + P{Ut>4m} (98) 
< P{Ag,t\I„,Ut<4Nt} + e-^'^\ (99) 

If we call U the event {Ut < 4:Nt}, we found 

C{2t)>q-2f^Y.^,f,{a)P{Aq,t\Ia,K} + e-""'/^^ 
Call F{a) = P{Aq^t\Ia,i^}, and choose a £ (1/2, 1). By Holder inequality 

Y,^,p{a)F{a) = Y,^2p{a) (j^^ F{a) (101) 

1;/^2/3(ct) \y^^,2p{a)F{a)\ (102) 



(100) 



< 



l-a 



Z(/3(2 - a-M)"Z(2/3)i-" V- . X . ^ , . 

= Z{P) |E/^2/3(^)F(a)| , (103) 

where, in passing from (jl02p to (|103p . we used the fact that F{a) < 1 and the definition of Z{(3). A 
moment of thought reveals that 

Y,^^2p{<y)F{a) = P{Aq,\Q{^) = IM] < ^^{^(0)1^^'^ (104) 

p{g(o) = i} - p{Q(o) = i} ^ ' 
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(the last inequality follows from the fact that P{Ut < ANt) > 1/2 for t > -^log2). Next notice that 
P{Q{t) =q} = Z{q; /3)/Z(/3)2, and P{Q(0) = 1} = Z{2p)/Z{pf. Putting the various terms together, 
we obtain 

C{2t) > q - 2e-^*/2 - 2(8jVt)^~" ^^^^^ ~ "'^2^^^^' • (106) 

Z{fj) 

Using Lemmas m and O the product of the partition functions can be upper bounded by exp[— A^((l — 
a)U{q;l3) — (/>(/?, a) — 6/2)], for any 5 > 0, with high probability. With the hypothesis of the lemma 
8Nt > 1, we can thus replace [8Nty~" by 8Nt in the inequality. Moreover this prefactor 8A^ will be 
smaller than exp[A^(5/2] for large enough N. We will also have 2e~^*/^ < S. Fixing q in such a way 
to achieve the sup in Eq. ()85p and optimizing over a £ (1/2, 1) completes the proof. □ 

We can now prove the low temperature lower bound in Proposition [2j Consider the sum of the 
local correlation functions 

N N 
1=1 i=l 

One has C{t) = C{t) — {Q12), where Qu is the normalized overlap of two i.i.d. configurations cr^^\ 
and cr^^) distributed according to the Boltzmann measure (j20p . (j2ip . Lemma [6] implies that, for any 
6 > 0, C{t) > C{t) — 6 with high probability. By Lemma [TJ we have therefore 

C(t) > -25-te-^[^-^] . (108) 

Fix = e^t^^^"^]. Then, for N large enough, we have C{t^:) > — 36. If f{e) is the fraction of sites 
i such that Ti{e) > t^, then C(i*) < /(e) + e, hence /(e) > q^ — 35 — e. The thesis follows by rescaling 
6. □ 
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